Unicode Planes
   HOME

TheInfoList



OR:

In the
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
standard, a plane is a continuous group of 65,536 (216)
code point In character encoding terminology, a code point, codepoint or code position is a numerical value that maps to a specific character. Code points usually represent a single grapheme—usually a letter, digit, punctuation mark, or whitespace—but ...
s. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position
hexadecimal In mathematics and computing, the hexadecimal (also base-16 or simply hex) numeral system is a positional numeral system that represents numbers using a radix (base) of 16. Unlike the decimal system representing numbers using 10 symbols, hexa ...
format (U+''hhhhhh''). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version , five of the planes have assigned code points (characters), and seven are named. The limit of 17 planes is due to
UTF-16 UTF-16 (16-bit computing, 16-bit Unicode Transformation Format) is a character encoding capable of encoding all 1,112,064 valid code points of Unicode (in fact this number of code points is dictated by the design of UTF-16). The encoding is variab ...
, which can encode 220 code points (16 planes) as pairs of
words A word is a basic element of language that carries an objective or practical meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consen ...
, plus the BMP as a single word.
UTF-8 UTF-8 is a variable-width encoding, variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit'' ...
was designed with a much larger limit of 231 (2,147,483,648) code points (32,768 planes), and would still be able to encode 221 (2,097,152) code points (32 planes) even under the current limit of 4 bytes. The 17 planes can accommodate 1,114,112 code points. Of these, 2,048 are Universal Character Set characters#Surrogates, surrogates (used to make the pairs in UTF-16), 66 are Universal Character Set characters#Non-characters, non-characters, and 137,468 are Universal Character Set characters#Private-Use Characters, reserved for private use, leaving 974,530 for public assignment. Planes are further subdivided into Unicode blocks, which, unlike planes, do not have a fixed size. The 327 blocks defined in Unicode cover 26% of the possible code point space, and range in size from a minimum of 16 code points (sixteen blocks) to a maximum of 65,536 code points (Supplementary Private Use Area-A and -B, which constitute the entirety of planes 15 and 16). For future usage, ranges of characters have been tentatively mapped out for most known current and ancient writing systems.


Overview


Assigned characters


Basic Multilingual Plane

The first plane, plane 0, the Basic Multilingual Plane (BMP) contains characters for almost all modern languages, and a large number of Unicode symbols, symbols. A primary objective for the BMP is to support the unification of prior character sets as well as characters for writing. Most of the assigned code points in the BMP are used to encode Chinese, Japanese, and Korean (CJK characters, CJK) characters. The High Surrogate (U+D800–U+DBFF) and Low Surrogate (U+DC00–U+DFFF) codes are reserved for UTF-16#Code points from U+010000 to U+10FFFF, encoding non-BMP characters in UTF-16 by using a ''pair'' of 16-bit codes: one High Surrogate and one Low Surrogate. A single surrogate code point will never be assigned a character. 65,520 of the 65,536 code points in this plane have been allocated to a Unicode block, leaving just 16 code points in a single unallocated range (2FE0..2FEF). , the BMP comprises the following 164 blocks: * Basic Latin (Unicode block), Basic Latin (Lower half of ISO/IEC 8859-1: ISO/IEC 646:1991-IRV aka ASCII) (0000–007F) * Latin-1 Supplement (Upper half of ISO/IEC 8859-1) (0080–00FF) * Latin Extended-A (0100–017F) * Latin Extended-B (0180–024F) * IPA Extensions (0250–02AF) * Spacing Modifier Letters (02B0–02FF) * Combining Diacritical Marks (0300–036F) * Greek and Coptic (0370–03FF) * Cyrillic (Unicode block), Cyrillic (0400–04FF) * Cyrillic Supplement (0500–052F) * Armenian (Unicode block), Armenian (0530–058F) * Aramaic Scripts: ** Hebrew (Unicode block), Hebrew (0590–05FF) ** Arabic (Unicode block), Arabic (0600–06FF) ** Syriac (Unicode block), Syriac (0700–074F) ** Arabic Supplement (0750–077F) ** Thaana (Unicode block), Thaana (0780–07BF) ** NKo (Unicode block), N'Ko (07C0–07FF) ** Samaritan (Unicode block), Samaritan (0800–083F) ** Mandaic (Unicode block), Mandaic (0840–085F) ** Syriac Supplement (0860–086F) ** Arabic Extended-B (0870–089F) ** Arabic Extended-A (08A0–08FF) * Brahmic scripts, Brahmic scripts: ** Devanagari (Unicode block), Devanagari (0900–097F) ** Bengali (Unicode block), Bengali (0980–09FF) ** Gurmukhi (Unicode block), Gurmukhi (0A00–0A7F) ** Gujarati (Unicode block), Gujarati (0A80–0AFF) ** Oriya (Unicode block), Oriya (0B00–0B7F) ** Tamil (Unicode block), Tamil (0B80–0BFF) ** Telugu (Unicode block), Telugu (0C00–0C7F) ** Kannada (Unicode block), Kannada (0C80–0CFF) ** Malayalam (Unicode block), Malayalam (0D00–0D7F) ** Sinhala (Unicode block), Sinhala (0D80–0DFF) ** Thai (Unicode block), Thai (0E00–0E7F) ** Lao (Unicode block), Lao (0E80–0EFF) ** Tibetan (Unicode block), Tibetan (0F00–0FFF) ** Myanmar (Unicode block), Myanmar (1000–109F) * Georgian (Unicode block), Georgian (10A0–10FF) * Hangul Jamo (Unicode block), Hangul Jamo (1100–11FF) * Ethiopic (Unicode block), Ethiopic (1200–137F) * Ethiopic Supplement (1380–139F) * Cherokee (Unicode block), Cherokee (13A0–13FF) * Unified Canadian Aboriginal Syllabics (Unicode block), Unified Canadian Aboriginal Syllabics (1400–167F) * Ogham (Unicode block), Ogham (1680–169F) * Runic (Unicode block), Runic (16A0–16FF) * Philippine scripts: ** Tagalog (Unicode block), Tagalog (1700–171F) ** Hanunoo (Unicode block), Hanunoo (1720–173F) ** Buhid (Unicode block), Buhid (1740–175F) ** Tagbanwa (Unicode block), Tagbanwa (1760–177F) * Khmer (Unicode block), Khmer (1780–17FF) * Mongolian (Unicode block), Mongolian (1800–18AF) * Unified Canadian Aboriginal Syllabics Extended (18B0–18FF) * Brahmic scripts, Brahmic scripts: ** Limbu (Unicode block), Limbu (1900–194F) * Tai languages, Tai scripts: ** Tai Le (Unicode block), Tai Le (1950–197F) ** New Tai Lue (Unicode block), New Tai Lue (1980–19DF) ** Khmer Symbols (Unicode block), Khmer Symbols (19E0–19FF) ** Buginese (Unicode block), Buginese (1A00–1A1F) ** Tai Tham (Unicode block), Tai Tham (1A20–1AAF) * Combining Diacritical Marks Extended (1AB0–1AFF) * Languages of Indonesia, Indonesian scripts: ** Balinese (Unicode block), Balinese (1B00–1B7F) ** Sundanese (Unicode block), Sundanese (1B80–1BBF) ** Batak (Unicode block), Batak (1BC0–1BFF) * Lepcha (Unicode block), Lepcha (1C00–1C4F) * Ol Chiki (Unicode block), Ol Chiki (1C50–1C7F) * Cyrillic Extended-C (1C80–1C8F) * Georgian Extended (1C90–1CBF) * Sundanese Supplement (1CC0–1CCF) * Vedic Extensions (1CD0–1CFF) * Latin supplements: ** Phonetic Extensions (1D00–1D7F) ** Phonetic Extensions Supplement (1D80–1DBF) ** Combining Diacritical Marks Supplement (1DC0–1DFF) ** Latin Extended Additional (1E00–1EFF) * Greek Extended (1F00–1FFF) * Unicode symbols, Symbols: ** General Punctuation (2000–206F) ** Superscripts and Subscripts (Unicode block), Superscripts and Subscripts (2070–209F) ** Currency Symbols (Unicode block), Currency Symbols (20A0–20CF) ** Combining Diacritical Marks for Symbols (20D0–20FF) ** Letterlike Symbols (2100–214F) ** Number Forms (2150–218F) ** Arrows (Unicode block), Arrows (2190–21FF) ** Mathematical Operators (Unicode block), Mathematical Operators (2200–22FF) ** Miscellaneous Technical (2300–23FF) ** Control Pictures (2400–243F) ** Optical Character Recognition (Unicode block), Optical Character Recognition (2440–245F) ** Enclosed Alphanumerics (2460–24FF) ** Box Drawing (2500–257F) ** Block Elements (2580–259F) ** Geometric Shapes (Unicode block), Geometric Shapes (25A0–25FF) ** Miscellaneous Symbols (2600–26FF) ** Dingbats (Unicode block), Dingbats (2700–27BF) ** Miscellaneous Mathematical Symbols-A (27C0–27EF) ** Supplemental Arrows-A (27F0–27FF) ** Braille Patterns (2800–28FF) ** Supplemental Arrows-B (2900–297F) ** Miscellaneous Mathematical Symbols-B (2980–29FF) ** Supplemental Mathematical Operators (2A00–2AFF) ** Miscellaneous Symbols and Arrows (2B00–2BFF) * Glagolitic (Unicode block), Glagolitic (2C00–2C5F) * Latin Extended-C (2C60–2C7F) * Coptic (Unicode block), Coptic (2C80–2CFF) * Georgian Supplement (2D00–2D2F) * Tifinagh (Unicode block), Tifinagh (2D30–2D7F) * Ethiopic Extended (2D80–2DDF) * Cyrillic Extended-A (2DE0–2DFF) * Supplemental Punctuation (2E00–2E7F) * CJK characters, CJK scripts and symbols: ** CJK Radicals Supplement (2E80–2EFF) ** Kangxi Radicals (Unicode block), Kangxi Radicals (2F00–2FDF) ** Ideographic Description Characters (Unicode block), Ideographic Description Characters (2FF0–2FFF) ** CJK Symbols and Punctuation (Unicode block), CJK Symbols and Punctuation (3000–303F) ** Hiragana (Unicode block), Hiragana (3040–309F) ** Katakana (Unicode block), Katakana (30A0–30FF) ** Bopomofo (Unicode block), Bopomofo (3100–312F) ** Hangul Compatibility Jamo (3130–318F) ** Kanbun (Unicode block), Kanbun (3190–319F) ** Bopomofo Extended (31A0–31BF) ** CJK Strokes (Unicode block), CJK Strokes (31C0–31EF) ** Katakana Phonetic Extensions (31F0–31FF) ** Enclosed CJK Letters and Months (3200–32FF) ** CJK Compatibility (3300–33FF) ** CJK Unified Ideographs Extension A (3400–4DBF) ** Yijing Hexagram Symbols (Unicode block), Yijing Hexagram Symbols (4DC0–4DFF) ** CJK Unified Ideographs (Unicode block), CJK Unified Ideographs (4E00–9FFF) * Yi Syllables (Unicode block), Yi Syllables (A000–A48F) * Yi Radicals (Unicode block), Yi Radicals (A490–A4CF) * Lisu (Unicode block), Lisu (A4D0–A4FF) * Vai (Unicode block), Vai (A500–A63F) * Cyrillic Extended-B (A640–A69F) * Bamum (Unicode block), Bamum (A6A0–A6FF) * Modifier Tone Letters (A700–A71F) * Latin Extended-D (A720–A7FF) * Brahmic scripts, Brahmic scripts: ** Syloti Nagri (Unicode block), Syloti Nagri (A800–A82F) ** Common Indic Number Forms (A830–A83F) ** Phags-pa (Unicode block), Phags-pa (A840–A87F) ** Saurashtra (Unicode block), Saurashtra (A880–A8DF) ** Devanagari Extended (A8E0–A8FF) ** Kayah Li (Unicode block), Kayah Li (A900–A92F) ** Rejang (Unicode block), Rejang (A930–A95F) * Hangul Jamo Extended-A (A960–A97F) * Brahmic scripts, Brahmic scripts: ** Javanese (Unicode block), Javanese (A980–A9DF) ** Myanmar Extended-B (A9E0–A9FF) ** Cham (Unicode block), Cham (AA00–AA5F) ** Myanmar Extended-A (AA60–AA7F) ** Tai Viet (Unicode block), Tai Viet (AA80–AADF) ** Meetei Mayek Extensions (AAE0–AAFF) * Ethiopic Extended-A (AB00–AB2F) * Latin Extended-E (AB30–AB6F) * Cherokee Supplement (AB70–ABBF) * Meetei Mayek (Unicode block), Meetei Mayek (ABC0–ABFF) * Hangul Syllables (Unicode block), Hangul Syllables (AC00–D7AF) * Hangul Jamo Extended-B (D7B0–D7FF) * Universal Character Set characters#Surrogates, Surrogates: ** High Surrogates (Unicode block), High Surrogates (D800–DB7F) ** High Private Use Surrogates (Unicode block), High Private Use Surrogates (DB80–DBFF) ** Low Surrogates (Unicode block), Low Surrogates (DC00–DFFF) * Private Use Areas, Private Use Area (E000–F8FF) * CJK Compatibility Ideographs (F900–FAFF) * Alphabetic Presentation Forms (FB00–FB4F) * Arabic Presentation Forms-A (FB50–FDFF) * Variation Selectors (Unicode block), Variation Selectors (FE00–FE0F) * Vertical Forms (FE10–FE1F) * Combining Half Marks (FE20–FE2F) * CJK Compatibility Forms (FE30–FE4F) * Small Form Variants (FE50–FE6F) * Arabic Presentation Forms-B (FE70–FEFF) * Halfwidth and fullwidth forms (Unicode block), Halfwidth and Fullwidth Forms (FF00–FFEF) * Specials (Unicode block), Specials (FFF0–FFFF)


Supplementary Multilingual Plane

Plane 1, the Supplementary Multilingual Plane (SMP), contains historic scripts (except CJK ideographic), and symbols and notation used within certain fields. Scripts include Linear B, Egyptian hieroglyphs, and cuneiform scripts. It also includes English reform orthographies like Shavian and Deseret alphabet, Deseret, and some modern scripts like Osage alphabet, Osage, Warang Citi, Adlam alphabet, Adlam, Wancho language, Wancho and Toto language, Toto. Symbols and notations include historic and modern musical notation; Mathematical alphanumeric symbols, mathematical alphanumerics; shorthands; Emoji and other pictographic sets; and game symbols for playing cards, mahjong, and dominoes. , the SMP comprises the following 151 blocks: * Archaic Greek and Other Left-to-right scripts: ** Linear B Syllabary (Unicode block), Linear B Syllabary (10000–1007F) ** Linear B Ideograms (Unicode block), Linear B Ideograms (10080–100FF) ** Aegean Numbers (Unicode block), Aegean Numbers (10100–1013F) ** Ancient Greek Numbers (Unicode block), Ancient Greek Numbers (10140–1018F) ** Ancient Symbols (Unicode block), Ancient Symbols (10190–101CF) ** Phaistos Disc (Unicode block), Phaistos Disc (101D0–101FF) ** Lycian (Unicode block), Lycian (10280–1029F) ** Carian (Unicode block), Carian (102A0–102DF) ** Coptic Epact Numbers (Unicode block), Coptic Epact Numbers (102E0–102FF) ** Old Italic (Unicode block), Old Italic (10300–1032F) ** Gothic (Unicode block), Gothic (10330–1034F) ** Old Permic (Unicode block), Old Permic (10350–1037F) ** Ugaritic (Unicode block), Ugaritic (10380–1039F) ** Old Persian (Unicode block), Old Persian (103A0–103DF) ** Deseret (Unicode block), Deseret (10400–1044F) ** Shavian (Unicode block), Shavian (10450–1047F) ** Osmanya (Unicode block), Osmanya (10480–104AF) ** Osage (Unicode block), Osage (104B0–104FF) ** Elbasan (Unicode block), Elbasan (10500–1052F) ** Caucasian Albanian (Unicode block), Caucasian Albanian (10530–1056F) ** Vithkuqi (Unicode block), Vithkuqi (10570–105BF) ** Linear A (Unicode block), Linear A (10600–1077F) ** Latin Extended-F (10780–107BF) * Right-to-left scripts: ** Cypriot Syllabary (Unicode block), Cypriot Syllabary (10800–1083F) ** Imperial Aramaic (Unicode block), Imperial Aramaic (10840–1085F) ** Palmyrene (Unicode block), Palmyrene (10860–1087F) ** Nabataean (Unicode block), Nabataean (10880–108AF) ** Hatran (Unicode block), Hatran (108E0–108FF) ** Phoenician (Unicode block), Phoenician (10900–1091F) ** Lydian (Unicode block), Lydian (10920–1093F) ** Meroitic Hieroglyphs (Unicode block), Meroitic Hieroglyphs (10980–1099F) ** Meroitic Cursive (Unicode block), Meroitic Cursive (109A0–109FF) ** Kharoshthi (Unicode block), Kharoshthi (10A00–10A5F) ** Old South Arabian (Unicode block), Old South Arabian (10A60–10A7F) ** Old North Arabian (Unicode block), Old North Arabian (10A80–10A9F) ** Manichaean (Unicode block), Manichaean (10AC0–10AFF) ** Avestan (Unicode block), Avestan (10B00–10B3F) ** Inscriptional Parthian (Unicode block), Inscriptional Parthian (10B40–10B5F) ** Inscriptional Pahlavi (Unicode block), Inscriptional Pahlavi (10B60–10B7F) ** Psalter Pahlavi (Unicode block), Psalter Pahlavi (10B80–10BAF) ** Old Turkic (Unicode block), Old Turkic (10C00–10C4F) ** Old Hungarian (Unicode block), Old Hungarian (10C80–10CFF) ** Hanifi Rohingya (Unicode block), Hanifi Rohingya (10D00–10D3F) ** Rumi Numeral Symbols (Unicode block), Rumi Numeral Symbols (10E60–10E7F) ** Yezidi (Unicode block), Yezidi (10E80–10EBF) ** Arabic Extended-C (10EC0–10EFF) ** Old Sogdian (Unicode block), Old Sogdian (10F00–10F2F) ** Sogdian (Unicode block), Sogdian (10F30–10F6F) ** Old Uyghur (Unicode block), Old Uyghur (10F70–10FAF) ** Chorasmian (Unicode block), Chorasmian (10FB0–10FDF) ** Elymaic (Unicode block), Elymaic (10FE0–10FFF) * Brahmic scripts, Brahmic scripts: ** Brahmi (Unicode block), Brahmi (11000–1107F) ** Kaithi (Unicode block), Kaithi (11080–110CF) ** Sora Sompeng (Unicode block), Sora Sompeng (110D0–110FF) ** Chakma (Unicode block), Chakma (11100–1114F) ** Mahajani (Unicode block), Mahajani (11150–1117F) ** Sharada (Unicode block), Sharada (11180–111DF) ** Sinhala Archaic Numbers (Unicode block), Sinhala Archaic Numbers (111E0–111FF) ** Khojki (Unicode block), Khojki (11200–1124F) ** Multani (Unicode block), Multani (11280–112AF) ** Khudawadi (Unicode block), Khudawadi (112B0–112FF) ** Grantha (Unicode block), Grantha (11300–1137F) ** Newa (Unicode block), Newa (11400–1147F) ** Tirhuta (Unicode block), Tirhuta (11480–114DF) ** Siddham (Unicode block), Siddham (11580–115FF) ** Modi (Unicode block), Modi (11600–1165F) ** Mongolian Supplement (Unicode block), Mongolian Supplement (11660–1167F) ** Takri (Unicode block), Takri (11680–116CF) ** Ahom (Unicode block), Ahom (11700–1174F) ** Dogra (Unicode block), Dogra (11800–1184F) ** Warang Citi (Unicode block), Warang Citi (118A0–118FF) ** Dives Akuru (Unicode block), Dives Akuru (11900–1195F) ** Nandinagari (Unicode block), Nandinagari (119A0–119FF) ** Zanabazar Square (Unicode block), Zanabazar Square (11A00–11A4F) ** Soyombo (Unicode block), Soyombo (11A50–11AAF) * Unified Canadian Aboriginal Syllabics Extended-A (11AB0–11ABF) * Brahmic scripts: ** Pau Cin Hau (Unicode block), Pau Cin Hau (11AC0–11AFF) ** Devanagari Extended-A (11B00–11B5F) ** Bhaiksuki (Unicode block), Bhaiksuki (11C00–11C6F) ** Marchen (Unicode block), Marchen (11C70–11CBF) ** Masaram Gondi (Unicode block), Masaram Gondi (11D00–11D5F) ** Gunjala Gondi (Unicode block), Gunjala Gondi (11D60–11DAF) ** Makasar (Unicode block), Makasar (11EE0–11EFF) ** Kawi (Unicode block), Kawi (11F00–11F5F) * Lisu Supplement (11FB0–11FBF) * Tamil Supplement (11FC0–11FFF) * Cuneiform (Unicode block), Cuneiform (12000–123FF) * Cuneiform Numbers and Punctuation (Unicode block), Cuneiform Numbers and Punctuation (12400–1247F) * Early Dynastic Cuneiform (Unicode block), Early Dynastic Cuneiform (12480–1254F) * Cypro-Minoan (Unicode block), Cypro-Minoan (12F90–12FFF) * Egyptian Hieroglyphs (Unicode block), Egyptian Hieroglyphs (13000–1342F) * Egyptian Hieroglyph Format Controls (13430–1345F) * Anatolian Hieroglyphs (Unicode block), Anatolian Hieroglyphs (14400–1467F) * Bamum Supplement (Unicode block), Bamum Supplement (16800–16A3F) * Mro (Unicode block), Mro (16A40–16A6F) * Tangsa (Unicode block), Tangsa (16A70–16ACF) * Bassa Vah (Unicode block), Bassa Vah (16AD0–16AFF) * Pahawh Hmong (Unicode block), Pahawh Hmong (16B00–16B8F) * Medefaidrin (Unicode block), Medefaidrin (16E40–16E9F) * Miao (Unicode block), Miao (16F00–16F9F) * Ideographic Symbols and Punctuation (Unicode block), Ideographic Symbols and Punctuation (16FE0–16FFF) * Tangut (Unicode block), Tangut (17000–187FF) * Tangut Components (Unicode block), Tangut Components (18800–18AFF) * Khitan Small Script (Unicode block), Khitan Small Script (18B00–18CFF) * Tangut Supplement (18D00–18D7F) * Kana Extended-B (1AFF0–1AFFF) * Kana Supplement (Unicode block), Kana Supplement (1B000–1B0FF) * Kana Extended-A (Unicode block), Kana Extended-A (1B100–1B12F) * Small Kana Extension (1B130–1B16F) * Nushu (Unicode block), Nushu (1B170–1B2FF) * Duployan (Unicode block), Duployan (1BC00–1BC9F) * Shorthand Format Controls (Unicode block), Shorthand Format Controls (1BCA0–1BCAF) * Unicode symbols, Symbols: ** Musical notation: *** Znamenny Musical Notation (1CF00–1CFCF) *** Byzantine Musical Symbols (Unicode block), Byzantine Musical Symbols (1D000–1D0FF) *** Musical Symbols (Unicode block), Musical Symbols (1D100–1D1FF) *** Ancient Greek Musical Notation (Unicode block), Ancient Greek Musical Notation (1D200–1D24F) ** Kaktovik Numerals (Unicode block), Kaktovik Numerals (1D2C0–1D2DF) ** Mayan Numerals (Unicode block), Mayan Numerals (1D2E0–1D2FF) ** Mathematical symbols: *** Tai Xuan Jing Symbols (Unicode block), Tai Xuan Jing Symbols (1D300–1D35F) *** Counting Rod Numerals (Unicode block), Counting Rod Numerals (1D360–1D37F) *** Mathematical Alphanumeric Symbols (Unicode block), Mathematical Alphanumeric Symbols (1D400–1D7FF) ** Sutton SignWriting (Unicode block), Sutton SignWriting (1D800–1DAAF) * Latin Extended-G (1DF00–1DFFF) * Glagolitic Supplement (Unicode block), Glagolitic Supplement (1E000–1E02F) * Cyrillic Extended-D (1E030–1E08F) * Nyiakeng Puachue Hmong (Unicode block), Nyiakeng Puachue Hmong (1E100–1E14F) * Toto (Unicode block), Toto (1E290–1E2BF) * Wancho (Unicode block), Wancho (1E2C0–1E2FF) * Nag Mundari (Unicode block), Nag Mundari (1E4D0–1E4FF) * Ethiopic Extended-B (1E7E0–1E7FF) * Mende Kikakui (Unicode block), Mende Kikakui (1E800–1E8DF) * Adlam (Unicode block), Adlam (1E900–1E95F) * Unicode symbols, Symbols: ** Indic Siyaq Numbers (1EC70–1ECBF) ** Ottoman Siyaq Numbers (1ED00–1ED4F) ** Arabic Mathematical Alphabetic Symbols (Unicode block), Arabic Mathematical Alphabetic Symbols (1EE00–1EEFF) ** Game tiles and cards: *** Mahjong Tiles (Unicode block), Mahjong Tiles (1F000–1F02F) *** Domino Tiles (Unicode block), Domino Tiles (1F030–1F09F) *** Playing Cards (Unicode block), Playing Cards (1F0A0–1F0FF) ** Enclosed Alphanumeric Supplement (Unicode block), Enclosed Alphanumeric Supplement (1F100–1F1FF) ** Enclosed Ideographic Supplement (Unicode block), Enclosed Ideographic Supplement (1F200–1F2FF) ** Miscellaneous Symbols and Pictographs (Unicode block), Miscellaneous Symbols and Pictographs (1F300–1F5FF) ** Emoticons (Unicode block), Emoticons (1F600–1F64F) ** Ornamental Dingbats (Unicode block), Ornamental Dingbats (1F650–1F67F) ** Transport and Map Symbols (Unicode block), Transport and Map Symbols (1F680–1F6FF) ** Alchemical Symbols (Unicode block), Alchemical Symbols (1F700–1F77F) ** Geometric Shapes Extended (Unicode block), Geometric Shapes Extended (1F780–1F7FF) ** Supplemental Arrows-C (Unicode block), Supplemental Arrows-C (1F800–1F8FF) ** Supplemental Symbols and Pictographs (Unicode block), Supplemental Symbols and Pictographs (1F900–1F9FF) ** Chess Symbols (1FA00–1FA6F) ** Symbols and Pictographs Extended-A (1FA70–1FAFF) ** Symbols for Legacy Computing (1FB00–1FBFF)


Supplementary Ideographic Plane

Plane 2, the Supplementary Ideographic Plane (SIP), is used for CJK Ideographs, mostly CJK Unified Ideographs, that were not included in earlier character encoding standards. , the SIP comprises the following six blocks: * CJK Unified Ideographs Extension B (20000–2A6DF) * CJK Unified Ideographs Extension C (2A700–2B73F) * CJK Unified Ideographs Extension D (2B740–2B81F) * CJK Unified Ideographs Extension E (2B820–2CEAF) * CJK Unified Ideographs Extension F (2CEB0–2EBEF) * CJK Compatibility Ideographs Supplement (2F800–2FA1F)


Tertiary Ideographic Plane

Plane 3 is the Tertiary Ideographic Plane (TIP). CJK Unified Ideographs Extension G was added to the TIP in Unicode 13.0, released in March 2020. It also is tentatively allocated for Oracle bone script, Oracle Bone script and Small Seal Script. , the TIP comprises the following two blocks: * CJK Unified Ideographs Extension G (30000–3134F) * CJK Unified Ideographs Extension H (31350–323AF)


Unassigned planes

Planes 4 to 13 (planes to in
hexadecimal In mathematics and computing, the hexadecimal (also base-16 or simply hex) numeral system is a positional numeral system that represents numbers using a radix (base) of 16. Unlike the decimal system representing numbers using 10 symbols, hexa ...
): No characters have yet been assigned, or proposed for assignment, to Planes 4 through 13.


Supplementary Special-purpose Plane

Plane 14 ( in hexadecimal) is designated as the Supplementary Special-purpose Plane (SSP). It comprises the following two Unicode block, blocks, : *Tags (Unicode block), Tags (E0000–E007F) *Variation Selectors Supplement (Unicode block), Variation Selectors Supplement (E0100–E01EF) – used to indicate alternate glyphs for characters.


Private Use Area Planes

The two planes 15 and 16 (planes and in hexadecimal) each contain a "Private Use Areas, Private Use Area". They contain blocks named Supplementary Private Use Area-A (PUA-A) and -B (PUA-B). The Private Use Areas are available for use by parties outside ISO and Unicode (private character encoding).


References

{{Unicode navigation Unicode, Plane